Tagging Funding Agencies and Grants in Scientific Articles using Sequential Learning Models

نویسندگان

  • Subhradeep Kayal
  • Zubair Afzal
  • George Tsatsaronis
  • Sophia Katrenko
  • Pascal Coupet
  • Marius A. Doornenbal
  • Michelle Gregory
چکیده

In this paper we present a solution for tagging funding bodies and grants in scientific articles using a combination of trained sequential learning models, namely conditional random fields (CRF), hidden markov models (HMM) and maximum entropy models (MaxEnt), on a benchmark set created in-house. We apply the trained models to address the BioASQ challenge 5c, which is a newly introduced task that aims to solve the problem of funding information extraction from scientific articles. Results in the dry-run data set of BioASQ task 5c show that the suggested approach can achieve a micro-recall of more than 85% in tagging both funding bodies and grants.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Big Science vs. Little Science: How Scientific Impact Scales with Funding

AGENCIES THAT FUND SCIENTIFIC RESEARCH MUST CHOOSE is it more effective to give large grants to a few elite researchers, or small grants to many researchers? Large grants would be more effective only if scientific impact increases as an accelerating function of grant size. Here, we examine the scientific impact of individual university-based researchers in three disciplines funded by the Natura...

متن کامل

Machine Learning Techniques for Automatically Extracting Contextual Information from Scientific Publications

Scholarly publishing increasingly requires automated systems that semantically enrich documents in order to support management and quality assessment of scientific output. However, contextual information, such as the authors’ affiliations, references, and funding agencies, is typically hidden within PDF files. To access this information we have developed a processing pipeline that analyses the ...

متن کامل

Metrics associated with NIH funding: a high-level view

OBJECTIVE To introduce the availability of grant-to-article linkage data associated with National Institutes of Health (NIH) grants and to perform a high-level analysis of the publication outputs and impacts associated with those grants. DESIGN Articles were linked to the grants they acknowledge using the grant acknowledgment strings in PubMed using a parsing and matching process as embodied ...

متن کامل

Some Aspects of the State-of-the-Arts in Biomedical Science Research: A Perspective for Organizational Change in African Academia.

In the biomedical sciences, there is need to generate solutions for Africa's health and economic problems through the impact of university research. To guide organizational transformation, the author here presents some aspects of the state-of-the-arts of biomedical science research in advanced countries using a perspective derived from the FASEB journal publications. The author examines the thi...

متن کامل

Collective allocation of science funding: from funding agencies to scientific agency

Public agencies like the U.S. National Science Foundation (NSF) and the National Institutes of Health (NIH) award tens of billions of dollars in annual science funding. How can this money be distributed as efficiently as possible to best promote scientific innovation and productivity? The present system relies primarily on peer review of project proposals. In 2010 alone, NSF convened more than ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017